Protein O-Sulfonation 

Inventors: Alma L. Burlingame, Katalin F. Medzihradszky, Zsuzsanna Darula, Eran Perlson, 
Michael Fainzilber, Robert J. Chalkley, Darren Tyson and Ralph A. Bradshaw 

This work was supported by NIH NCRR Grant, RR 01614. The U.S. government 
may have rights in any patent issuing on this application. 

INTRODUCTION 

Field of the Invention 

The field of the invention is detecting and modulating O-sulfonation of serine and threonine 
residues on polypeptides. 

Background of the Invention 

Sulfonation occurs as a common enzymatic modification of endogenous substances 
including proteins, carbohydrates, catecholamines, and estrogenic steroids as well as 
xenobiotic chemicals [1]. Sulfonation refers to the transfer of the sulfonate group (SO3"*) 
from 3'-phosphoadenosine-5'-phosphosulfate (PAPS), the only known sulfonate donor [2]. 
Sulfonation can occur through several types of linkages, esters (O-sulfonation), amides (N- 
sulfonation) and thioesters (S-sulfonation) [3], of which O-sulfonation is the most prominent. 
The transfer of SOj'^ to a hydroxyl acceptor (O-sulfonation) generates an ester sulfate, and 
this reaction has commonly been referred to as sulfation rather than the more accurate O- 
sulfonation. 

The majority of cellular sulfonation is O-sulfonation and occurs primarily on steroids, 
catecholamines and thyroid hormones [1]. The sulfonation of these molecules is catalyzed by 
the soluble cytosolic sulfotransferases and appears to alter their bioactivity. For example, 
estrogen, testosterone and thyroid hormones (T3 and T4) can interact with their respective 
receptors to regulate transcription whereas their sulfate-containing moieties cannot. 
Furthermore, the half-life of these compounds in blood is significantly shorter than that of 
their conjugated counterparts suggesting that sulfonation maintains these compoimds in an 
inactive state ready for rapid deployment by the removal of the sulfonyl group. 

While the c>losolic sulfotransferases conjugate cell-permeable or intracellular 
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compounds, the membrane-bound Golgi-associated sulfotransferases are primarily 
responsible for sulfonation of extracellular proteins via a co- or post-translational 
mechanism. The membrane-bound sulfotransferases are responsible for the sulfonation of 
various glycosaminoglycans such as heparin and heparan sulfate. Additionally, such enzymes 
catalyze the direct sulfonation of proteins on the 4-0-position of tyrosine residues [4]. It is 
one of the last modifications to occur during protein transiting the trans-Golgi and thus has 
been found almost exclusively on secreted and plasma membrane proteins of all metazoan 
species examined. In addition, there is a large body of evidence that this modification is 
present usually at the interface of interacting proteins and hence, is known to modulate 
extracellular protein-protein interactions. In humans, protein tyrosine sulfonation has been 
implicated in proteins of the vasculature and hemostasis. Examples include the mediation of 
inflammatory leukocyte adhesion, chemokine receptors and modulation of the blood 
coagulation cascade [5]. Significantly, only tyrosine residues have been described as sites for 
O-sulfonation within proteins, and O-sulfonation of proteins has not previously been shown 
to occur within the cytosol. Several tyrosyl protein sulfotransferases [6,7] and arylsulfatases 
[8] present in the trans-Golgi have been described, but unlike tyrosine 
phosphorylation/dephosphorylation [9], there is no evidence of dynamic regulation of 
tyrosine sulfonation [4,5]. Until now, only widespread modification of tyrosine has been 
observed [10,11]. 

We have discovered the occurrence of sulfonation as a posttranslational modification 
of serine and threonine residues, and that this sulfonation is involved in the modulation of 
protein-protein interactions, and in particular, has regulatory fimctions in receptor tyrosine 
kinase signaling as discussed fiirther below. We have exploited this finding to develop 
methods of detecting and modulating this serine and threonine sulfonation. 

SUMMARY OF THE INVENTION 
The invention provides methods and compositions for detecting and/or modulating O- 
sulfonation of serine or threonine residues on cell-made polypeptides. In one embodiment, 
the invention provides a method of detecting a post-translational protein modification, 
comprising the step of: specifically detecting O-sulfonation of a serine or threonine residue of 
the protein. This method niay be practiced comparatively, wherein the detected O- 
sulfonation is detected under a first physiological condition, and is compared with a control 
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0-sulfonation detected under a second physiological condition, particularly wherein a 
difference between the detected and control O-sulfonations indicates a difference between the 
first and second physiological conditions. The detected and control O-sulfonations may be 
detected directly, indirectly (e.g. by specifically detecting a biochemical marker 
predetermined to be specifically correlated with the change in O-sulfonation) or inferentially 
(e.g. by specifically detecting a physiological change predetermined to be specifically 
correlated with the change in O-sulfonation). 

Also disclosed are diagnostic reagents specific for O-sulfonated proteins, particularly 
antibodies which specifically bind incorporated O-suIfonated serine or threonine residues. 
Accordingly, the invention provides an isolated antibody which specifically binds an O- 
sulfonated protein, wherein the specific binding is dependent on the presence of an O- 
sulfonated serine or threonine residue in the protein. 

In another embodiment, specific, predetermined changes in physiological conditions 
(includmg biochemical signaling) are used to infer specific changes in O-sulfonation. For 
example, this embodiment includes a method of detecting a change in O-sulfonation of a 
serine or threonine residue of a protein, comprising the step of specifically detecting a 
physiological change predetermined to be correlated with the change in O-sulfonation. 

hi another embodiment, the invention provides a method of detecting a serine or 
threonine protein sulfotransferase or sulfatase activity, comprising the step of: specifically 
detecting a change in O-sulfonation of a serine or threonine residue of a protein, wherein the 
change in O-sulfonation indicates the sulfotransferase or sulfatase activity. 

In another embodiment, the invention provides methods of modifying proteins by 
introducing a predetermined change in O-sulfonation at a serine or threonine residue of the 
protein, and optionally, detecting a resultant change in O-sulfonation, for example, by 
specifically detecting a physiological change predetermined to be correlated with the change 
in O-sulfonation. These methods include introducing or increasing O-sulfonation at the serine 
or threonine of the protein; eliminating or reducing O-sulfonation at the serine or threonine of 
the protein; and derivatizing or substituting O-sulfonation at the serine or threonine of the 
protein. 

In another embodiment, the invention provides kits comprising a reagent for use in a 
subject method, and optionally, an instructional medium describing a subject method. The 
invention also provides business methods specifically adapted to, and/or incorporating a 
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description of, or reference to a subject method or kit. 

DETAILED DESCRIPTION OF PARTICULAR EMBODIMENTS OF THE INVENTION 
The following descriptions of particular embodiments and examples are offered by 
way of illustration and not by way of limitation. Unless contraindicated or noted otherwise, 
in these descriptions and throughout this specification, the terms "a" and "an" mean one or 
more, the term "or" means and/or. 

Protein sulfonation on serine and threonine residues is a natural post-translational 
modification of numerous and diverse proteins, particularly in eukaryotic organisms. Serine 
and threonine sulfonation is used to regulate a broad and diverse array of protein functions, 
including assembly and signal transduction. This invention provides methods and 
compositions for detecting and/or modulating natural and artificial O-sulfonation of serine or 
threonine residues on proteins. The methods are applicable to an enormous diversity of 
natural and non-natural proteins, including intracellular, membrane-bound, and extracellular 
proteins, wherein the term protein generically encompasses biomolecules comprising a 
peptide-bond linked amino acid polymer (i.e. peptide or polypeptide) made by a cell, and 
includes such polymers subject to post-translation modifications, including glycosylation, 
and joined to other biomolecules (e.g. to form proteo-lipids), or incorporated into larger 
macromolecules. Preferred target proteins are human proteins, particularly human blood 
proteins. 

Described below are diverse exemplary methods of using and commercially 
exploiting O-sulfonation of the serine or threonine residue of the protein. Of course, the 
invention includes existent methods and uses relating to tyrosine sulfonation and 
phosphorylation and glycosylation of serine and threonine residues, as adapted to sulfonated 
serine and threonine residues; e.g. Bettelheim, J Am Chem Soc, 1954, 75, 2838-2839; Hiltz 
et al. PNAS 1955, 41, 880; Leyh, Revs Biochem and Mol Biol., 1993, 28, 515-542. 

O-sulfonation of the serine or threonine residue of the protein may be specifically 
detected by any convenient analytical technique, including specific chemical reagents, such 
as antibodies specific to the modified residues, chemical analysis, which may be performed 
directly or by derivatizing the O-sulfonated groups and detecting the resultant product, 
radiolabeling, such as with ^^S04, and mass fragmentation analysis by mass spectrometry. 

For sensitive detection, high performance hybrid triple quadrupole/linear ion trap 
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mass spectrometers are particularly amenable. For example, the recent ABI/Sciex 
QTRAP4000 (Applied Biosystems, Foster City, CA) detects and selectively sequences 
phospho- versus sulfo peptides via observation of m/z = -79 vs -80 during a HPLC run of a 
protein digest. Another instrument, the Thermo Finnigan (San Jose, CA) LTQ FT-ICR ESI 
System is able to measure masses accurately within 1 ppm - revealing differences in accurate 
mass between HP-(30.97376+ 1.007825) and S-(3 1.97207) nominal mass isobars. Equipped 
with electron capture dissociation (ECD) that selectively cleaves peptide backbone bonds and 
leaves weaker, chemically labile posttranslational bonds intact, this system accelerates the 
site specific localization of phospho versus sulfo Ser/Thr and O-GlcNAc. 

Detection reagents comprising 0-sulfonated serine and threonine-specific moieties 
include somatically recombined polypeptide receptors Uke specific antibodies or T-cell 
antigen receptors (see, e.g Harlow and Lane (1999) Using Antibodies, A Laboratory Manual, 
Cold Spring Harbor Laboratory Press), intracellular binding agents identified with assays 
such as one-, two- and three-hybrid screens, non-natural intracellular binding agents 
identified in binding assays of chemical hbraries, etc. In particular, protocols for generating 
specific monoclonal and polyclonal antibodies are well-known, including human, humanized, 
and hybrid human-murine antibodies. In general, sulfopeptide-specific antibodies may be 
made by covalently coupling the sulfopeptide to a complex antigen, and emulsifying the 
conjugate in an adjuvant, such as Freunds complete adjuvant. Laboratory animals, such as 
rabbits are immunized according to conventional protocol and bled. Alternatively, protocols 
for in vitro immunization are also well known. The presence of specific antibodies may be 
assayed by convenient solid phase immunosorbant assays using immobilized sulfo-peptides, 
and to ensure specificity, the corresponding negative control non-sulfonated peptide. 
Positive and negative affinity purification is readily effected using immobilized sulfo- 
peptides and negative control peptides. 

Protocols for generating antibodies to poorly immunogenic epitopes are well known 
in the art; e.g. the use of antigen-selected combinatorial phage antibody libraries (e.g. 
Williamson et al. 1998, J Virol 72, 9413-18). In particular protocols designed to maximize 
yield of sulfopeptide-specific antibodies, we conjugate sulfopeptides having lengths between 
5 and 15 residues, at >90-96% purity, immunize at least 3 animals, and affinity purify the 
antibodies over both the non-sulfopeptide and sulfopeptide-Sepharose. Using these criteria., 
we describe how to generate a panel of O-linked serine- and threonine-sulfonated peptide 
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specific monoclonal antibodies using a panel of corresponding O-linked serine- and 
threonine-sulfonated peptides conjugated to keyhole limpet hemocyanine (KLH); see 
Examples, below. Hence, also disclosed are diagnostic antibody reagents which 
specifically bind incorporated O-sulfonated serine or threonine residues. Accordingly, the 
invention provides an isolated antibody which specifically binds an O-sulfonated protein, 
wherein the specific binding is dependent on the presence of an O-sulfonated serine or 
threonine residue in the protein. These specific binding agents may be modified or 
derivatized, for example with a label, to provide convenient reporters. 

The methods encompass detecting a plurality of different and often predetermined, 
different serine and threonine sulfonations, which may be on one or more proteins. Specific 
sulfonation pattems are associated with particular proteins, and particular conditions. Hence, 
serine and threonine sulfonation, and pattems thereof, are used to characterize proteins and 
physiological conditions, and vice versa. In particular, specific, predetermined changes in 
physiological conditions (including biochemical signaling) are used to infer specific changes 
in O-sulfonation. For example, this embodiment includes a method of detecting a change in 
0-sulfonation of a serine or threonine residue of a protein, comprising the step of specifically 
detecting a physiological change predetermined to be correlated with the change in O- 
sulfonation. 

Detection methods may be practiced comparatively, wherein the detected O- 
sulfonation is detected under a first physiological condition, and is compared with a control 
O-sulfonation detected under a second physiological condition, and a difference between the 
detected and control O-sulfonations indicates a difference between the first and second 
physiological conditions. Hence, a physiological change, preferably predetermined, is 
detected inferentially by detecting a change in O-sulfonation of a serine or threonine residue 
of a protein, also preferably predetermined, wherein the change in O-sulfonation of the serine 
or threonine residue of the protein indicates the physiological change. Analogously, the 
invention provides methods for detecting a change in O-sulfonation of a serine or threonine 
residue of a protein by detecting a physiological change predetermined to be correlated with 
the change in O-sulfonation. 

hi natural cells, numerous serine and threonine protein sulfotransferases and 
sulfatases effect serine and threonine O-sulfonation and desulfonation. Additionally, O- 
sulfonated serine and threonine residues mediate their functionaUties by participating in a 
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wide variety of interaiolecular interactions with natural, specific interacting molecules, 
particularly proteins. These natural interacting molecules, particularly proteins, and 
particularly sulfotransferases and sulfatases, are readily detected and isolated using O- 
sulfonated serine- and threonine containing peptides and proteins in established affinity 
strategies, including affinity chromatography, two-hybrid screening, affinity panning, etc. 
These sulfotransferases and sulfatases may also be detected by specifically detecting a 
change in O-sulfonation of a serine or threonine residue of a protein, particularly a 
predetermined change predetermined to be correlated with a predetermined sulfotransferase 
or sulfatase, wherein the change in O-sulfonation indicates the sulfotransferase or sulfatase 
activity. 

Natural, specific interacting molecules, and derivatives, including deletion mutants 
thereof which retain binding specificity, provide specific probes for detecting incorporated 
O-sulfonated serine or threonine residues. Hence, also disclosed are diagnostic reagents 
specific for O-sulfonated proteins, in addition to antibodies, which specifically bind 
incorporated O-sulfonated serine or threonine residues. As with specific antibodies, these 
specific binding agents may be modified or derivatized, for example with a label, to provide 
convenient reporters. 

Li another embodiment, the invention provides methods of modifying proteins by 
introducing a preferably predetermined change in O-sulfonation at a serine or threonine 
residue of a preferably predetermined protein, and optionally, detecting a resultant change in 
O-sulfonation, for example, by specifically detecting a physiological change predetermined 
to be correlated with the change in O-sulfonation. The modification may be made inside or 
outside the cell expressing the protein. These methods include introducing or increasing O- 
sulfonation at the serine or threonine of the protein; eliminating or reducing O-sulfonation at 
the serine or threonine of the protein; and derivatizing or substituting O-sulfonation at the 
serine or threonine of the protein. Generally, the methods are practiced by directly or 
indirectly activating or inhibiting a preferably specific serine/threonine sulfatase or 
sulfotransferase. For example, eliminating or reducing O-sulfonation at the serine or 
threonine of the protein may be effected with a serine/threonine sulfotransferase inhibitor 
(e.g. Armstrong, et al. Curr. Opin. Drug Disc. Dev., 2000, 3, 502-515), such as a 
carbohydrate sulfotransferase inhibitor (e.g. Armstrong et al., Chem. hit. Ed. Engl. 39, 1303- 
1306, 2000), a purine sulfotransferase inhibitor (e.g. Verdugo, et al.. J. Med. Chem., 2001, 
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44, 2683-2686), a bisubstrate sulfotransferase inhibitor (e.g. Armstrong, et al. Org. Lett, 
2001, 3, 2657-2660; Armstrong et al. J Org Chem. 2003, 68, 170-3) and combinatorial target- 
guided ligand assembly generated sulfotransferase inhibitors (e.g. Kehoe et al., Bioorg Med 
Chem Lett. 2002, 12, 329-332). 
5 In this regard, we disclose cross-talk between several post-translational 

serine/threonine O-linked modifications, including O-linked glycosylation (e.g. GlcNAc), O- 
linked sulfonation, and O-linked phosphorylation. Since these alternative modifications can 
occupy the same serines and threonines, they can be interdependently regulated. For 
example, selective inhibition of O-linked glycosylation and phosphorylation can be used to 
10 promote O-linked sulfonation at the same site. Methods and reagents for regulating O-linked 
phosphatases and kinases are well known in the art, as are analogous methods for regulating 
O-linked carbohydrate transferases and sulfatases (e.g. Winans et al. Chem Biol. 2002, 9, 
113-129; Grunwell et al. Biochem. 2002, 41, 13117-13126) and O-linked 
phosphotransferases. 

15 Methods for inducing beta elimination and Michael addition to O-GlcNAc sites are 

well-established (Vosseler et al., 2002, Curr Opn Chem Biol 6, 851-7), and analogous 
protocols are established for modifying phosphorylation sites (e.g. Knight et al., Nat 
Biotechnol. 2003 Aug 17, Epub ahead of print). Hence, O-GlcNAc modifications on serine 
and threonine, and serine-phosphate may be beta eliminated, and then modified by any 

20 selected Michael reagent. Selectivity of the modification may be enhanced, for example, by 
treating a target protein with a phosphatase to remove phosphorylation-sites and then beta- 
eliminating to get 0-GlcNAC sites, etc. O-sulfo-serine and -threoine sites can be similarly 
beta-eliminated and labeled. 

For example, Wells, et al. (2002, Mol Cell Proteomics 1(10), 791-804) describe beta- 

25 elimination/Michael addition method protocols for mapping sites of O-modification using 
affinity tags for serine and threonine post-translational modifications, including 
modifications that rely on mild beta-elimination followed by Michael addition with reagents 
such as dithiothreitol. The reference also recites methods for using synthetic peptides wherein 
biotin pentylamine replaces dithiothreitol as the nucleophile. The modified peptides can be 

30 efficiently enriched by affinity chromatography, and the sites can be mapped using tandem 
mass spectrometry. This same methodology can be appUed to mapping sites of serine and 
threonine sulfonation, and provides a strategy that uses modification-specific antibodies and 
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enzymes to discriminate between sulfonation, phosphorylation and GlcNAC O-serine and O- 
threonine post-translational modifications. 

Proteins and peptides containing or modified to contain predetermined O-serine or O- 
threonine sulfonation provide a wide variety of industrial uses, including immunogens for 
5 eliciting specific antibodies, antigens for detecting such antibodies or for use in affinity 
binding pairs with such antibodies, labels or affinity reagents for binding sulfatases and 
sulfotransferases, competitive inhibitors of sulfo-protein specific interaction etc. For 
example, O-serine sulfonated peptides can be shown to inhibit native sulfonation-dependent 
protein-protein interactions using methods analogous to those of Roos et al. 1997, Mol Cell 

10 Biol, 17, 6472-80. 

The invention also provides kits specifically tailored to practicing the subject 
methods. For example, in one embodiment, the kits comprise one or more materials for 
detecting or modifying a preferably predetermined O-sulfonation of serine or threonine of a 
preferably predetermined protein and an associated instructional medium describing a subject 

15 method. 

The invention also provides business methods specifically tailored to practicing the 
subject methods. For example, in one embodiment, the business methods comprise selling, 
contracting, or licensing a subject method or composition for detecting or modifying a 
preferably predetermined O-sulfonation of serine or threonine of a preferably predetermined 
20 protein. 

EXAMPLES 

I. Identification and isolation of differentially serine and threonine O-sulfonated proteins. 

Here we report the discovery and structural characterization of O-sulfonation of both 
serine and threonine residues in proteins of diverse class and function isolated fi-om 

25 eukaryotes spanning the range fi-om a imicellular parasite to humans. These include a 

neuronal intermediate filament protein fi-om the snail (Lymnaea stagnalis), a cathepsin - C 
like protein fi-om the protazoan malaria parasite {Plasmodium falciparum) and cytoplasmic 
constructs of the human orphan receptor tyrosine kinase, Ror2. The presence of this new 
posttranslational protein modification was detected and characterized by on-line HPLC- 

30 tandem electrospray mass spectrometry fi-om proteins isolated by SDS-PAGE. 

Proteins isolated firom Lymnaea stagnalis nerve axoplasm were subjected to 2D-gel 
screening. Differentially expressed protein spots were visualized by a mass spectrometric 



SF04-016 



compatible silver stain, excised, digested in-gel with trypsin, and analyzed by LC-CDDMS 
(see below). Tryptic peptide sequences were deduced from interpretation of the collision- 
induced dissociation (CID) spectra measured [12]. Our Bogyo malaria protein isolation 
protocol has been previously described [14]. 

The transmembrane and cytoplasmic domains of human Ror2 were amplified by RT- 
PCR from total RNA isolated from human SH-SY5Y cells. The cDNA, encompassing 
residues 427-943 with an Xhol site in place of the stop codon, was subcloned into pcDNA6- 
Myc/His-A (Livitrogen) to add a carboxyl-terminal Myc/His tag resulting in the plasmid 
termed pc6-Ror2cytoMH. To target the Ror2 construct to the inner surface of the membrane, 
the chicken c-Src myristylation signal was added to the amino terminus starting at residue 
432 using the imique SgrAI site within the myristylation sequence, creating the pc6- 
myrRor2cytoMH vector. Residues 749-943 were deleted from the construct by generating a 
PGR fragment with an Xhol site after residue 748 to create the pc6-myrRor2DMH vector. 

Human embryonic kidney 293T cells were cultured in Dulbecco's Modified Eagle's 
Medium containing 10% fetal bovine serum and 1% penicillin/streptomycin solution 
(Livitrogen). Approximately 800,000 cells were seeded per 60 mm dish and transiently 
transfected using LipofectAMINETM 2000 (Livitrogen) according to the manufacturer's 
recommended protocol. Cells were incubated for 48 hours prior to harvesting. Cells were 
washed twice with ice-cold phosphate-buffered saline and scraped into 750 mL of the same 
solution. Cells were pelleted by centrifugation at -5,000 x g for 1 min and supernatant was 
removed by aspiration. Cells were lysed in a protein extraction buffer (PEB) and cleared by 
centrifiigation at -14,000 x g for 10 min at 4°C. Myc/His tagged proteins were 
immimoprecipitated by incubating 1 mg of each sample with 15 mL of agarose-conjugated 
anti-myc antibody overnight at 4°C. Agarose beads were washed twice with PEB and once in 
a kinase buffer. Beads were then incubated in kinase buffer containing 100 mM ATP for 30 
min at 30°C. Reactions were stopped by the addition of 5 mL 6X SDS-PAGE loading buffer. 

The digests and synthetic peptides were analyzed by capillary HPLC-ESI- 
QqoaTOFMS using an Ultimate HPLC system equipped with a FAMOS autosampler and a 
CI 8 PepMap 75mmxl50 mm column (Dionex-LC-Packings, San Francisco). Solvent A was 
0.1% formic acid in water, and B was 0.1% formic acid in acetonitrile, at a flow rate of -350 
nL/min. Approximately 1/10 of each digest was injected at 5% B, then the organic content of 
the mobile phase was increased linearly to 50% over 30 min. The column effluent was 
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directed to a QSTAR Pulsar tandem mass spectrometer (Applied Biosystems/MDS Sciex, 
Toronto, CA). During the elution of the peptides 1 sec MS acquisitions were followed by 5 
sec CID experiments for computer-selected precursor ions in information-dependent 
acquisition mode. The collision energy was set according to the mass value and charge state 
of the precursor ion. The CID spectra were interpreted manually. 

Protein class and function assessment was carried out with the aid of a variety of 
bioinformatics including database homology search engines [16] such as MS-Pattem and 
MS-BLAST. 

We conducted 2D-gel analyses of neuronal axoplasm from the freshwater snail 
Lymnaea stagnalis; proteins differentially expressed in nerve regeneration were subjected to 
in-gel tryptic digestion followed by reverse phase chromatographic separation and mass 
spectral analysis of the digest mixture. Peptide sequences were deduced by interpretation of 
their collision-induced dissociation (CID) mass spectra, and used to query the NCBI 
nonredundant protein database to assign tentative homologies in sequenced genomes. This 
effort revealed a number of protein spots with extensive homology to intermediate filament 
proteins from other moUusks, including i/eto [18] andAplysia [19]. 

During the course of carrying out de novo sequencing of these particular proteins, a 
number of digest components were discovered during an LC-ESIMS experiment that 
displayed identical mass values for their entire CID sequence ion series (viz. identical 
fragmentation pattems), but eluted with significantly different chromatographic retention 
times (deltaTR= l-5min) and possessed different molecular weights. In fact, all of the later 
eluting components of these identical fragmentation pattem pairs displayed an 80 Da 
increment in their measured molecular weight. Analysis of these LC-CIDMS spectra 
revealed eight such tryptic peptides in one particular digest. The chromatographic ion 
extraction profiles corresponding to the mass values of the modified peptides and their 
unmodified counterparts were retrieved from the entire digest LC/MS data set. 

These data indicate analogs having identical peptide sequences modified by a moiety 
of AM = 80 Da. From the nominal mass spectrum, these modified analogs are stable during 
electrospray ionization under ambient conditions. This behavior indicated the modification 
was a sulfo-moiety, rather than the isobaric phospho-modification, consistent with results 
from protein phosphorylation studies [20-27]. 

Therefore, in order to establish that the observed modifications were due to O- 
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sulfonation, a series of sulfo- and corresponding phospho-peptides was synthesized. For 
example, to compare the chromatographic retention and mass spectral fragmentation 
behavior, the sequence LAGLQDEIGSLR (SEQ ID NO:06) was synthesized as such together 
with its phospho- and sulfoseryl- analogs. These synthetic peptides were studied by 
5 LC/MS/MS under similar conditions to those employed during analyses of the original gel 
plug digests. The retention time observed for the sulfo-modified peptide was later than its 
unmodified counterpart noted in the original experiment vida supra. Interpretation of the 
resuhs from these experiments including CID spectra established that the sulfo- peptides do 
eliminate the modification upon deposition of sufficient vibronic energy to induce 

10 dissociation of the peptide backbone bonds. They produce low energy CID mass spectra 

virtually identical to the corresponding unmodified molecules, but the phospho-serine analog 
undergoes only partial b-ehmination [20-26]. Finally, we have observed that sulfo-threonyl 
peptides behave similarly to their sulfo-seryl analog molecules. 

Plasmodium falciparum and human cell culture: In addition to the fireshwater snail, 

15 similar experiments on digests derived from proteins isolated from both the malaria parasite, 
Plasmodium falciparum, and from human embryonic kidney 293T cells revealed peptide 
analogs that are covalently modified on both serine and threonine residues. 

In the case oi Plasmodium falciparum, one of the proteins was detected using a 
suicidal substrate probe for cysteine protease activity [14]. It was identified as a remote 

20 homolog of cathepsin C based on de novo sequences obtained from tandem mass 

spectrometry. One of its tryptic peptides has been shown by analogous CID analysis to be 7- 
O-sulfono-RIEVALTK (SEQ ID NO:09). This finding was confirmed by synthesis of this 
peptide and its modified analogs and subsequent measurements that showed comparable 
mass spectral behavior. 

25 In addition, during mass spectrometric characterization of the myristyl 

juxtamembrane construct of the human tyrosine kinase receptor Ror2, the peptide 
^^'LKEISLSAVR^'^ (SEQ ID NO: 10) was observed together with two other 
chromatographically distinct peaks. CID analyses of each of these produced fragmentation 
spectra demonstrative of sulfonation. These results indicate that this sequence exists modified 

30 at either of the two serine residues available for sulfonation. 

The fragmentation behavior of phospho-peptides has been studied extensively for 
over a decade. It is well known that p-elimination of the elements of phosphoric acid (-98 
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Da) from Ser- and Thr-modified peptides occurs as a favored dissociation process [20-26]. 
However, quantitative elimination of phospho-functions does not occur either during 
electrospray ionization or under low energy coUisional activation. Thus far the only 
observation of significant gas-phase dephosphorylation (-80Da) during electrospray 
ionization involved the highly reactive phospho-histidine species [28]. Therefore the 
observations described above are diagnostic for the presence of a sulfo moiety, rather than 
the corresponding phospho-analog. Extensive previous studies have shown that Tyr-sulfated 
peptides usually display some sulfate loss even during MS acquisition [29]. 

Our identification of serine/threonine sulfonations in proteins from very different 
organisms demonstrates that this modification is widespread, occurring ubiquitously in 
eukaryotes. Moreover, the exemplified modified proteins we report are targeted to three 
distinct cell compartments, cytoplasm, lysosome, and plasma membrane, demonstrating that 
serine/threonine sulfonation occurs both in the ER continuum and the cytoplasm. Our studies 
also revealed a large number of proteins to be differentially sulfonated across differing 
physiological conditions, including injured versus non-injured axoplasms (Table 1). 



Table 1 . Differentially serine and threonine sulfonated proteins, including 
intermediate filament proteins. 



1 


upregulated 


lEF 3-10 


MW~26 kDa, pl~4 


2 


upregulated 


BEF 4.5-5.5 


MW~26 kDa; pi - 5 


3 


upregulated 


ffiF 4.5-5.5 


MW~40 kDa, pl~4.6 


5 


upregulated 


lEF 4.5-5.5 


MW~110kDa, pi -5.3 


6 


upregulated 


lEF 4.5-5.5 


MW~108 kDa, pi- 5.4 


7 


downregulated 


lEF 4.5-5.5 


MW~99 kDa, pi 5.2 


8 


downregulated 


ffiF 4.5-5.5 


MW~99 kDa, pi 5.3 


9 


upregulated 


ffiF 4-7 


MW~26 kDa, pi 4.5 


13 


upregulated 


ffiF 4-7-5 


high MW, pi 


15 


upregulated 


ffiF 4-7 


high MW, pl>5 


120 


downregulated 


ffiF 4-7 


contains sulfopeptides with 
similarity to synaptotagmin. 
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Table 2. Differentially serine and threonine sulfonated peptides identified from CID in 



proteins 5, 6, 7, 8 and 9 



5A 


KVIDELAS SK [ 1 3 1 - 140] 


rSEO DD NO OD 


5B 


NAAYAELATR [340-349] 


fSEO ID NO 02'k 


5C 


YASQLNQLR [304-312] 


fSEO ID NO 03'i 


5D 


TLVEQAIGTQSK [428-439] 


rSEO ID NO 04'i 




oMol'LrV YV^V^J^oooLrll JJrK [47- 


(SEQ ID NO:05) 


6B 


661 KVIDELASSK ri3 1-1401 






T AHT ODFTGSIT 1^ Fl 47-1 SRI 




6D 


NAAYAELATR [340-349] 


(SEQ ID NO:02) 


6E 


YASQLNQLR [304-312] 


(SEQ ID NO:03) 


6F 


TLVEQAIGTQSK [428-439] 


(SEQ ID NO: 04) 


7A 


KVIDELASSK [131-140] 


rsEO ID NOon 


7B 


NAAYAELATR [340-349] 


fSEO ID NO 02^ 


8 A 




(oHQ ID JNUrUI) 


8B 


LAGLQDEIGSLR [147-158] 


fSEO ID NO 061 


8C 


NAAYAELATR [340-349] 


fSEO ID NO02'> 


8D 


YASQLNQLR [304-312] 


fSEO ID NO031 


8E 


VGLRTLVEQAIGTQSK [424-439] 


fSEO ID NO 071 


8F 


TLVEQAIGTQSK [428-439] 


rSEO ID NO 041 


OA 




(oiil^ UJ JNU.U!>) 




DO J JV V i±JiLL^l\\D O JV [ 1 ^ 1 - i H-U J 


^cur^ TT^ XT#^.ni \ 

(oxil^ JNUlUlj 






(oiiV^ iU JNU:Uoj 


9D 


ELIVTYESQAK [159-169] 


(SEQ ID NO:08) 


9E 


NAAYAELATR [340-349] 


(SEQ ID NO:02) 


9F 


YASQLNQLR [304-312] 


(SEQ ID NO:03) 


9G 


VGLRTLVEQAIGTQSK [424-439] 


(SEQ ID NO:07) 


9H 


TLVEQAIGTQSK [428-439] 


(SEQ ID NO:04) 



n. Preparation of high-affinity single chain sulfopeptide specific anitibodies. 

Nonimmune phage antibody libraries were constructed for the production of high- 
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affinity human single-chain antibodies (Sheets et aL, 1998, PNAS USA 95, 6157-62). Total 
RNA was prepared from three different samples of human spleen cells and from two 
different samples of human peripheral blood lymphocytes. cDNA was synthesized from total 
RNA primed with the HuIgMFOR primer (Marks, et al, 1991, J. Mol. Biol. 222, 581-597). 
Vh gene repertoires were amplified from the cDNA by using Vent DNA polymerase (New 
England Biolabs) in combination with the HuIgMFOR primer and an equimolar mixture of 
HuVHBACK primers. PGR products were agarose gel-purified and reamplified to append 
Ncol and Notl restriction sites by using Tth DNA polymerase (Epicentre Technologies, 
Madison, WI) and an equimolar mixture of the HuVHBACKSfi primers (that contain an Ncol 
site for cloning) and the HuCMForNot primer. The PGR products were cut with restriction 
enzymes Ncol and Notl and agarose gel-purified. The resulting DNA fragments were ligated 
into the plasmid pGITE3 A (Novagen) cut with restriction enzymes Ncol and Notl and the 
ligated DNA was electroporated into the E. coli strain TGI. A library of genes containing 
2.3 X 10^ members was generated from the products of seven ligation reactions and 15 
electroporations. The resulting library was termed pGITE-Vh- Gloning efficiency and library 
diversity was determined by PGR screening. The pGITE3 A plasmid was used to create the 
Vh gene repertoire because of the presence of unique sequences for PGR amplification that 
surround the Ncol and Notl cloning sites. These sequences allow the specific amplification of 
the Vh genes for scFv assembly. 

Gonstruction of the scFv Library. The Vh gene repertoire was PGR-amplified from 
the pGITE-Vh library by using 300 ng of library plasmid DNA as a template. Vent DNA 
polymerase, the CITE3 primer, and an equimolar mixture of HuJh primers. The Vl genes for 
scFv assembly were obtained from a previously constructed scFv phage antibody library 
(Marks et al., supra). The Vl gene repertoire, including DNA encoding the scFv peptide 
linker (648)3 (Huston, et al., 1988, PNAS USA 85, 5879-5883 ), was amplified from 300 ng 
of Kbrary plasmid DNA by using Vent DNA polymerase, the Gene3 primer, and an 
equimolar mixture of RHuJh primers. The amplified Vh and Vl genes were agarose gel- 
purified and spliced together with overlap extension PGR to create a scFv gene repertoire 
(Glackson, et aL, 1991, Nature, London, 352, 624-628). To accurately join Vh and Vl gene 
repertoires with overlap extension PGR, the input DNA firagments must have blunt ends. 
Therefore, the proofi-eading DNA polymerase Vent was used to generate the Vh and Vl DNA 
fi-agments for scFv assembly. For all subsequent PGR steps of library construction Tth DNA 
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polymerase was found to be the optimal enzyme. The V„ and Vl gene repertoires were 
spliced together in 100- |il PCRs containing 100 ng of the and Vl DNA fragments and Tth 
DNA polymerase. The reactions were cycled eight times (95''C 2 min, 55°C, 1 min, and 72°C 
3 min) to join the fragments. Then the CITES and Gene3 primers were added and the reaction 
5 was cycled 30 times (94°C 1 min, 55°C 1 min, and 72°C 3 min) to amplify the assembled 
scFv genes. The scFv genes were cut with restriction enzymes Ncol and Notl, agarose gel- 
purified, and ligated into the plasmid pHEN-1 (Hoogenboom, et al., 1991, Nucleic Acids 
Res. 19, 4133-4137) cut with Ncol and NotL The Ugated DNA was electroporated into E. coli 
TGI cells. 

10 Selection of Phage Antibodies. Phagemid particles were rescued from the library, as 

described (Schier, et al., 1996, J. Mol. Biol. 255, 28-43) except that the procedure was scaled 
up to 2 liters of culture media. Specific phage-displayed scFv were affinity-selected by using 
proteins absorbed to Lnmunotubes (Nimc). For selection of scFv to the sulfopeptides of 
Table 3, Lnmunotube selection is altemated with selection using decreasing concentrations of 

15 biotinylated sulfopeptides and capture of bound phage using streptavidin paramagnetic beads. 
Phage eluted from each selection are used to infect E. coli TGI cells. Phage particles are 
rescued from the cells and used for the subsequent round of antigen selection. The rescue- 
selection-plating cycle is repeated 3-4 times, after which individual clones are analyzed for 
specific antigen binding by ELISA. 

20 Antibody Binding Specificity. Binding specificity of all scFv is confirmed by ELISA 

using the target corresponding antigen sulfopeptide as substrate. Highly specific binding is 
indicated by or greater. 

Table 3. High affinity single chain sulfopeptide specific antibodies 





Phage 


Sulfopeptide 


SEQ ID NO 


scFV specific 


25 


antibody 






binding 
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sT853 


8-O-sulfono-KVTDFT ASISIC 


^oxlV^ UJ JNU.Ul j 


H-+-H- 


sV107 


9-O-sulfono-NAAYAFT ATT? 


(orl/V^ UJ JNvJiUzj 


++++ 


sR437 


3_0-«;ii1fonn-YA9nT TsTOT P 




+++ 


sR032 


9-O-sulfnno-TT VFOAinTO^K' 


UJ JNU.U4J 




sE209 


1 4-0-<;iilfnno- 


ypcjKl UJ InUIUj ) 






kJOXOx VJV X V^Vclji3oOVJrill-/PJV 


(orlrV,; UJ JNUiUo) 






1 n-ri-cnlfrkTio T Am OT^ThT/^CT T? 
1 \/ v^"ouixono-j^/\vjri^v^jjixll0^oijjv 


(iSiiQ UJ NO: 07) 




sB749 


1 l-O-siilfonn-VrrT RTT VFnATGTn^l^ 
X V-/ DUixv^iiU' V vjx^xv X 1^ V ll\j£\A.\J y. \J\jr^ 


^o±lV^ UJ iNU.UoJ 


1 1 1 1 


SY512 


8-O-sulfono-ELIVTYESQAK 


(SEQ ID NO:09) 


+++ 


SY268 


7-O-sulfono-RffiVALTK 


(SEQIDNO:10) 


++++ 


SU925 


7-O-sulfono-LKEISLSAVR 







ni. Preparation of high-affinity sulfopeptide specific monoclonal anitibodies. 

A library of peptide conjugates was made by coupling the panel of O-linked serine- 
and threonine-sulfonated peptides of Table 3 with keyhole hmpet hemocyanine (KLH). Our 
coupling protocol is essentially as described by Nishizawa et al. (1991, J Biol Chem 266, 
3074-3079). Corresponding monoclonal antibodies are prepared essentially as described 
previously (Yano, et al. (1994) Biochem. Biophys. Res. Commun. 175, 1 144-1 151). In brief, 
KLH emulsified in complete Freund's adjuvant are injected intraperitoneally into BALB/c 
mice. A booster of the KLH-conjugate emulsion is given to the mice at 2-week intervals. 
Three days after the final boost, the spleen cells are fused with mouse myeloma cell SP2/0- 
Ag 14, using polyethylene glycol 4000. The hybridomas producing anti-sulfopeptide 
antibody were screened by enzyme-linked immunosorbent assay in microtiter plates coated 
with the sulfopeptides. The cells from positive wells are cloned twice to ensure 
monoclonality. In this manner, the hybridoma clone H32 producing AB32 antibody was 
developed. The hybridoma cells are grown as ascites tumors in BALB/c mice primed with 
pristane. Homogeneous IgG fractions are prepared by applying ascites fluid on protein A-gel 
(Bio-Rad) and eluting it, according to the manufacturer's instructions. Immediately after the 
elution, IgG proteins are dialyzed against PBS. 

IV. Mutating sulfonated residues of differentially sulfonated axoplasma proteins. 

Differentially sulfonated axoplasma proteins 5, 6, and 8 (DSAP5, DSAP6 and 
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DSAP8, respectively) are sulfonated on serine/threonine residues 138, 51 and 436, 
respectively (Ser-138, Ser-51 and Thr-436, respectively). This example describes cell lines 
expressing these proteins with sulfonated serine/threonine residues converted to either Glu or 
Phe, and subsequent in situ immunolocalization of the mutant proteins under different axon 
conditions. These experiments reveal that the targeted sulfonation influences cellular 
locahzation of DSAP5, DSAP6 and DSAP8. Experimental protocols were adapted from 
Dong et al.. Biochemistry, 1994; 33(46); 13946-53. 

Polymerase chain reaction-based site-directed mutagenesis is performed using a 
commercial kit (QuickChange™, Stratagene, La Jolla, CA) directly on the mammalian 
expression vector pDX containing DSAP cDNA. By this method, codons for the subject Ser- 
138, Ser-51 and Thr-436 are converted to codons for either phenylalanine or glutamic acid. 
The mutant constructs are sequenced in their entirety to verify targeted mutations. The 
sequencing reaction is performed using a dye terminator kit, and the results analyzed on an 
ABI model 737A automated sequencer (ABI, San Leandro, CA). 

The mutant and wild-type DSAP constructs are cotransfected into PC 12 cells, which 
differentiate in the presence of NGF to a neuron-like phenotype. The plasmid pCDNA3 is 
transfected by electroporation; cotransfection with the DSAP mutants allows for selection of 
mutant-expressing cells by growth under G418 or geneticin; see, e.g. Tcherpakov et al, 2002- 
J. Biol Chem 277: 49101-49104. Transfected cells are first grown in alpha-minimal essential 
medium (a-MEM, Life Technologies, Inc.) without fetal bovine serum for 12-18 h and then 
grown in complete a-MEM medium supplemented with 10% heat-inactivated fetal bovine 
serum (Life Technologies, Inc.). For transient expression, cells are harvested 72 h after 
transfection. To establish stable cell lines, transfected cells are grown in complete a-MEM 
containing 500 |xg/ml hygromycin (Calbiochem). The cells are assayed for DSAP expression 
and localization by staining fixed, membrane-solublized cells in 1 |xg/ml monoclonal 
antibody (supra) and immunostaining with sheep anti-mouse IgG-fluorecein. 

V. Sulfospecific Proteolysis: Chemoenzymatic Mapping Sites of Protein Sulfonation. 

This example employs the selective chemical transformation of sulfoserine and 
sulfothreonine residues into lysine isosteres (aminoethylcysteine and p- 
methylaminoethylcysteine, respectively). Aminoethylcysteine-modified peptides are then 
selectively cleaved with a lysine-specific protease to map sites of sulfonation. A blocking 
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step enables single-site cleavage when desired, and adaptation of this reaction to the solid 
phase facilitates sulfopeptide enrichment and modification in one step. Our strategy reHes on 
the well-established p-elimination of sulfoserine residues to generate dehydroalanine under 
basic conditions (sulfothreonine is converted to p-methyldehydroalanine). Similar chemistry 
has been used to enrich and quantitate phosphoproteins for traditional trypsin digestion and 
MS/MS sequencing. In the next step, dehydroalanine acts as a Michael acceptor for 
cysteamine, generating an aminoethylcysteine residue (for sulfothreonine, P- 
methylaminoethylcysteine is generated). Since aminoethylcysteine is isosteric with lysine, 
proteases that recognize lysine (e.g. trypsin, Lys-C, and lysyl endopeptidase) will cleave 
proteins at this residue. We adapted this strategy firom an analogous approach for mapping 
protein phosphorylation (Knight et al., Nat Biotechnol. 2003 Aug 17, Epub ahead of print). 

A panel of sulfoserine and two sulfothreonine peptides was chosen to demonstrate the 
feasibility of this approach. Extensive peptide degradation results when standard p- 
elimination conditions (~1M hydroxide, 42-55 °C, >1 hour) are applied. To achieve 
quantitative p-elimination without peptide hydrolysis, we use barium hydroxide as the base, 
as well as a specific catalyst for sulfate elimination, at a concentration of 50mM; carry out 
reactions in a previously optimized mixture of DMSO, water, and ethanol; limit the reaction 
length and temperature; and perform the P-elimination and Michael addition steps 
consecutively, such that the addition of cysteamine to the basic reaction mixture in the 
second step reduces the pH of the reaction to - 8. Using this procedure, each peptide is 
cleanly converted into its aminoethylcysteine or p-methylraiinoethylcysteine analogue. 
Digestion of the aminoethylcysteine modified peptides with Lys-C or trypsin liberates 
peptide fi-agments corresponding to selective cleavage at the site of serine sulfonation. 
Hence, site-specific modification combined with proteolytic digestion allows for the 
unambiguous identification of serine and threonine sulfonation sites firom the exact masses of 
the liberated fi-agments. 

We also selected a model protein (the human tyrosine kinase receptor Ror2) that 
contains multiple sites of O-sulfonation to demonstrate this strategy for mapping sulfonation 
sites. Here, the protein is subject to aminoethylcysteine modification followed by co- 
digestion with trypsin or Lys-C. One pmol digested protein is separated by nanoflow liquid 
chromatography on a nano-C18 column and then directly analyzed by online LC-MS, and 
MS/MS on a quadmpole orthogonal TOP spectrometer (QSTAR instrument, PESciex). 
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Peptides are identified by mass fingerprinting (ESI-MS) corresponding to direct cleavage at 
all predicted sulfonation sites of the proteins, as well as by CID data. 

In some cases, it is desirable to obtain cleavage exclusively at the sulfonation site (not 
at lysine residues), generating larger firagments that might provide information about the 
gross topology of sulfonation. For example, the coexistence of unique sulfoisoforms 
(variants of a single protein that contain distinct combinations of sulfonated residues) could 
be investigated by this type of digestion. The structure of such sulfoisoforms is challenging 
to probe by traditional methods, since trypsin digestion intrinsically disconnects information 
about sulfonation sites that are separated by more than 10 to 20 residues (the firequency of a 
lysine or arginine residue). Alternatively, cleavage exclusively at sulfonation sites facilitates 
sulfonation mapping by N-terminal Edman degradation, since the first residues sequenced are 
those directly C-terminal to the site of sulfonation. 

The MARCKS substrate, a 25 residue peptide containing 12 lysine and modified to 
contain three sulfoserine residues, v^as selected to explore the feasibility of achieving 
exclusive cleavage at sulfonation sites. To do this, we first convert the lysine residues to 
homoarginine using o-methylisourea in order to block digestion at those sites with Lys-C. hi 
addition to blocking proteolytic digestion, this modification has several practical advantages, 
including (I) the enhancement of the ionization of homoarginine containing peptides in 
MALDI due to the introduction of the more basic guanidinium moiety, (ii) the elimination of 
the near mass degeneracy of lysine and glutamine, simplifying database searching based on 
mass fingerprinting, (iii) the retention of the positive charge of the lysine modified peptides 
and proteins, improving solubility, and (iv) reaction conditions that can facilitate the nearly 
quantitative (90-99%) guanidination of the lysine residues in fiiU-length proteins. 

The MARCKS substrate is guanidinated, the sulfoserine residues fiirther converted to 
aminoethylcysteine, digested with Lys-C, and finally subjected to mass analysis by MALDI. 
The MALDI mass spectrum fi*om the digest exhibits eight prominent peaks corresponding to 
eight of nine possible combinations of cleavage at the three sulfonation sites. The smallest 
fi-agment, corresponding to cleavage at aminoethylcysteine residues 8 and 12 (m/z = 652.4), 
requires longer digestion times and higher concentrations of protease to detect. No other 
major products are observed, confirming that homoarginine is not a substrate for Lys-C. 
Alternative chemistries are available for efficiently blocking lysine residues, and similar 
results are obtained with the MARCKS substrate by acetylating the lysines residues. 
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We also sought to couple aminoethylcysteine modification directly to an approach for 
sulfopeptide enrichment. For this purpose, we adapt the aminoethylcysteine reaction to a 
solid phase catch and release strategy to provide one step modijflcation and enrichment of 
sulfopeptides. This strategy allows sulfopeptides to be captured in the first step through an 
irreversible reaction with the dehydroalanine moiety. Li the next step, a different bond in the 
bead linker is labiUzed such that peptides are released while simultaneously unmasking a new 
chemical handle (aminoethylcysteine) for subsequent enzymatic interrogation. This type of 
approach facilitates automation and offers advantages over similar enrichment approaches 
that rely on selective biotinylation, which has been observed to complicate MS spectra. 

To prepare an appropriate solid phase reagent, a polyethyleneglycol-polystyrene 
(PEG-PS) copolymer base resin (TentaGel AC) is loaded with cystamine as the benzyl 
carbamate. This design incorporates two important features that facilitate 
aminoethylcysteine modification. First, the PEG-PS resin swells in both organic and aqueous 
solvents, allowing resin capture to be performed under conditions that have been optimized 
and vaUdated for the solution phase chemistry. Secondly, the methoxybenzyl carbamate 
linkage is stable to the basic conditions of the p-elimination reaction, allowing for efficient 
peptide capture, but highly acid labile, facilitating aminoethylcysteine peptide release by 
brief treatment with trifluoroacetic acid (TFA). 

We test the ability of this reagent to capture sulfopeptides and release them as the 
aminoethylcysteine derivative. Two non-sulfonated peptides, one sulfonated threonine 
peptide, and one sulfoserine peptide are mixed and added to the resin as an approximately 
equimolar mixture. After incubation with the resin under P-elimination conditions for one 
hour, the flow-through is analyzed by HPLC. The non-sulfonated peptides are detected 
intact, but the sulfoserine and sulfothreonine peaks are absent, consistent with selective 
capture of the sulfopeptides in high yield. Brief treatment with TFA releases the 
sulfopeptides as the aminoethylcysteine modified diastereomer pair, suitable for enzymatic 
sulfonation mapping. This approach allows proteolytic sulfonation site-mapping to be 
directly coupled to sulfopeptide enrichment from complex mixtures. 

hi summary, we describe here an approach for mapping protein sulfonation by direct 
enzymatic cleavage of polypeptides at the site of post-translational modification. This 
strategy is also applicable to mapping protein phosphorylation and glycosylation in an 
analogous fashion. Methods for distinguishing between these various O-modifications are 
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well-established, such as pretreatment with an appropriate glycosidase or phosphatase. 
Details of the experimental protocols used in this example are as follows: 
Sulfonation Mapping of Sulfoserine and Sulfothreonine Peptides. Model sulfoserine 
peptides (ca. 100 |xg) are dissolved in a 4:3:1 solution of H20:DMSO:EtOH (50 |il). The p- 
elimination solution (50 mM Ba(OH)2 solution (23 ^1) is added, and the reaction is incubated 
at 37" C. After 1 hour, a IM solution of cysteamine in H2O (50 |il) is added directly to this 
reaction and the reaction incubated an additional hour at room temperature. Reactions are 
analyzed by dilution into 1 ml H20/0.1% TFA and separation of the reaction products by 
reverse phase HPLC on a Dynamax SD-200 solvent delivery system (Rainin, Wobum, MA) 
equipped with a C-18 colimm. Individual fractions are analyzed by ESI-MS offline using a 
Micromass ZQ (Waters, Milford, MA) or by MALDI-MS and ESI-MS/MS. (below). Beta- 
elimination was also performed as follows: to 2 ul peptide solutions (1 pM/ul in water) we 
added 2 ul of 25mM Ba(OH)2, incubated for 1 hr at ZT C, added 2 ul of 25 mM ammonium 
sulfate, centrifiiged it, took 1 ul of the supematant, mid measured it by MALDI in DHB 
without further purification. 

For site-mapping, modified peptides are reconstituted in either 10 mM Tris, pH 8.5 
(Trypsin) or 10 mM Tris, pH 8.5, 1 mM EDTA (Lys-C) and digested for 4 hrs at ZT C. 
Reactions are analyzed as above. For FRET monitoring of the Lys-C digestion of 
diastereomeric aminoethylcysteine peptides, peptide diastereomers (- 5 p.g) are separated by 
HPLC, and digested with 5 [xg Trypsin. Reaction progress is monitored as emission at 420 
nm following excitation at 320 nm in a SpectraMax GeminiXS fluorescence plate reader 
(Molecular Devices, Sunnyvale, CA) as described. 

For guanidination reactions, the MARCKS substrate or p-casein was dissolved in 0.5 
M O-methylisourea, pH 10.5 and incubated overnight at 37° C. For acetylation reactions, the 
MARCKS substrate was dissolved in 100 mM NaHC03, pH 8.5 and treated with 
approximately 100 equivalents of sulphosuccinimidyl acetate (Pierce, Rockford, IL) for 2 
hours at room temperature to quantitatively acetylate lysine residues. Reactions were 
desalted by HPLC or dialysis and subjected to aminoethylcysteine modification and Lys-C 
digestion as above. 

Mass spectra are obtained by matrix-assisted laser desorption ionization time-of-flight 
(MALDI-TOF) mass spectrometry on a Voyager DESTR (Applied Biosystems, Framingham, 
MA). All mass spectra are acquired in positive-ionization mode with reflectron optics. The 
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instrument is equipped with a 337 nm nitrogen laser and operated under delayed extraction 
conditions in reflectron mode; a delay time of 200 nsec, and grid voltage 72.5% of full 
acceleration voltage (20 kV); the guide wire voltage is at 0.002%. For linear mode 
experiments, the delay time is 85 nsec and the grid voltage 94% of the acceleration voltage; 
the guide wire voltage is at 0.05%. Prior to MALDI-MS analysis, the proteolytic reaction 
mixtures are desalted with reversed-phase Zip Tipscg (CAS resin, Millipore, Bedford, MA). 
All peptide samples are prepared using a a-cyano-4-hydroxycinnamic acid (HCCA) (Agilent 
Technologies) matrix solution or 2,5-dihydroxy benzoic acid, saturated solution in water; 1 
[iL of analyte (0.1-1 pmol of material) is mixed with 1 of matrix solution, and then air- 
dried at room temperature on a stainless steel target. Typically, 200 laser shots are used to 
record each spectrum. The obtained mass spectra are extemally calibrated with an equimolar 
mixture of angiotensin I, ACTH 1-17, ACTH 18-39, and ACTH 7-38. 

Sulfonation Mapping of Ror2. Human Ror2 protein (ca. 100 |Lig) is modified using 
the same conditions as described for peptides. Following aminoethylcysteine modification, 
the protein is transferred to the digestion buffer by gel filtration (PD-10, Amersham 
Biosciences, Piscataway, NJ). Sequential digests with Trypsin or Lys-C are carried out at 37 
°C for approximately 6 hours each using ca. 1/10 enzyme by weight. 

For manipulation of smaller sample sizes (50 ng to 4 [ag), several modifications were 
made to the above protocol. Aminoethylcysteine modification is carried out in 0.5 ml 
microcentrifiige tubes and the sample volume is adjusted to maintain a protein concentration 
greater than or equal to 0.01 |ig/|il. When the reaction is complete, reagents are removed by 
dialysis overnight against 1 liter of 20 mM Tris, pH 8.0 using 10000 MWCO Slide-A-Lyzer 
Mini Dialysis units (Pierce, Rockford, XL). Dialyzed samples are transferred to a new 0.5 ml 
microcentrifuge tube, and the dialysis membrane is washed three times with 10 |il of 20 mM 
Tris, pH 8.0. Samples are concentrated to ~ 5 p.1 by Speedvac and 5 |il of acetonitrile added 
as a denaturant. The sample mixture is heated to 65" C for 10 minutes and then the digestion 
is initiated by the addition of 15 |al of 10 mM Tris, pH 8.0 containing trypsin at 1/10 enzyme 
to substrate by weight. Reactions are allowed to proceed approximately 6 hours at 37° C. 

The proteolytic peptide mixtures (ca. 1 pmol) are analyzed by MALDI-MS or by 
reversed-phase HPLC-MS/MS. Briefly, peptides are separated on an Ultimate nanocapillary 
HPLC system equipped with a PepMap™ C18 nano-column (75 jxm LD. x 15 cm) (LC 
Packings, Sunnyvale, CA) and CapTrap Micro guard colunm (0.5 ^il bed volume, Michrom, 
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Auburn, CA). Peptide mixtures are loaded onto the guard column and washed with the 
loading solvent (HjO/O.OS % formic acid, flow rate: 20 fil/min) for 5 min to remove salts and 
denaturing reagents, then transferred onto the C18-nanocapillary HPLC column and eluted at 
a flow rate of 300 nl/min using the following gradient: 2% B (from 0-5 min), and 2-70% B 
(from 5-55 min). Solvent A consists of 0.05 % formic acid in 98% H2O/2 % ACN and 
solvent B consists of 0.05% formic acid in 98% ACN/2% H2O. The column eluant is directly 
coupled to QSTAR quadrupole orthogonal TOF mass spectrometer (MDS Sciex, Concorde, 
Canada) equipped with a MicroIonSpray source (MDS Sciex, Concorde, Canada). The 
needle voltage is typically 2300 V in the HPLC-MS mode. Mass spectra (ESI-MS) and 
tandem mass spectra (ESI-MS/MS) are recorded in positive-ion mode with a resolution of 
10,000 FWHM. For coUision induced dissociation tandem mass spectrometry (CID-MS/MS), 
the mass window for precursor ion selection of the quadrupole mass analyzer is set to ± 3 
mass imit. The precursor ions are fragmented in a coUision cell using nitrogen as the 
collision gas. The LC-MS runs on the QSTAR instrument are acquired in "hiformation 
Dependent Acquisition" mode, which allows the user to acquire MS/MS spectra based on an 
inclusion/exclusion mass list/charge states and dysnamic assessment of relative ion 
intensities. The instrument is calibrated by infusing a renin peptide standard and using its 
MS/MS fragment-ions (His immonium-ion at m/z at 1 10.0713, and bg-ion at m/z at 
1028.5312) providing a mass accuracy of <50 ppm. 

Resin Synthesis. Tentagel AC resin (5 g) is swelled in anhydrous THF (75 ml) at 
room temperature under an inert atmosphere. 1,1 carbonyldiimidazole (2.5 g) is added and 
stirred for 3 hours. The resin is filtered, washed with THF, EtsO, and dried in vacuo 
overnight. Before use, cystamine HCl salt (5 g) is dissolved in H2O (45 ml), the pH is 
adjusted to 12 with NaOH, and the cystamine is extracted with CHjClj. The organic phase is 
dried with MgS04, filtered, and the solvent removed in vacuo to give a clear oil. This oil (ca. 
1 g) is added to the activated resin (2 g) swelled in THF (25 ml). N-methylmorpholine (2 ml) 
is added and the resin is heated to 60" C for 4 - 6 hours under an inert atmosphere. The resin 
is filtered, washed with THF, Et20, dried in vacuo, and stored at - 20" C. Immediately 
before use, the resin is deprotected by brief treatment (15 min.) with 100 mM DTT in HjO to 
expose the cysteamine thiol Quantitation of resin loading with EUman's reagent typically 
demonstrates 60 - 80% loading (0.20 to 0.25 mmol/g). 

SoHd-Phase Capture and Modification of Sulfoserine Peptides. Following 
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deprotection, the resin is washed with 5 times with HjO and 5 times with 4:3:1 
H20:DMSO:EtOH. Peptides are dissolved in 4:3:1 H20:DMSO:EtOH (250 |al) and added to 
80 mg of resin swollen in the same. 50mM Ba(OH)2 (225 |il) is added and the reaction is 
incubated for one hoxir at 37" C. The resin is then rinsed successively with HjO, DMF, 
5 CH2CI2 and Et20 and dried ovemight in vacuo. To release the peptides, the dried resin is 
suspended in 95:2.5:2.5 TFA:Me2S:H20 (1 ml) for 15 minutes at room temperature. The 
resin is then filtered, washed 3 times with TFA (1 ml), and the filtrate is concentrated in 
vacuo. The released peptides are taken up in H20/0.1% TFA and analyzed by HPLC (abs. 
215 nm) and MS. 

10 Supplementary Aminoethylcysteine Modification Protocol. Protein samples for 

aminoethylcysteine modification are first desaUed ovemight by microdialysis against 2 liters 
water using 10000 MWCO SUde-A-Lyzer Mini Dialysis units. Before use, dialysis units are 
soaked for one hour in water and the inner membrane is rinsed three times with 100 [il water. 
Following dialysis, dialyzed samples are transferred to 0.5 ml eppendorf tubes, and the 

15 dialysis membrane is washed three times with 20 ^l water. The combined dialyate is then 
concentrated by Speed- Vac to reduce volume to -5 |xl, with care not to concentrate to 
dryness. 5 jul of a 3:1 mixture of DMSO:EtOH is added directly to this sample. P- 
elimination is initiated by the addition of 4,6 ^1 sat. Ba(OH)2 and 1 |xl NaOH. For most 
proteins, a 2 hour incubation in a 37'' C water bath is recommended. At this stage, solutions 

20 of some fiiU-length proteins may appear somewhat heterogeneous; this has no effect on the 
efficiency of the reaction, but gentle vortexing every 20 or 30 minutes is recommended to 
prevent excessive aggregation. After two hours, the sample is placed at room temperature. 
While the sample is cooHng (5-10 minutes), a 1 M solution of cysteamine HCl is fireshly 
prepared, and 10 |il of this solution is added directly to the p-elimination reaction. This 

25 reaction is allowed to proceed 3 to 6 hours at room temperature. 

When the Michael addition reaction is complete, the protein solution is transferred to 
a rinsed mini-dialysis unit and dialyzed ovemight against 2 liters of 20 mM Tris, pH 8.0. 
The eppendorf tube fi-om the b-ehmination reaction is rinsed three times with 15 ml of 20 
mM Tris, pH 8.0 to ensure complete protein transfer. Following dialysis, the protein is 

30 transferred to a new 0.5 ml eppendorf tube, with careful rinsing of the dialysis membrane. 

This protein solution is then concentrated by Speed- Vac. This concentrated protein sample is 
then ready for digestion with appropriate proteases (e.g., trypsin or Lys-C) and analysis by 
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LC-MS/MS or MALDI-MS. In general, we find that Lys-C cleaves at modified sites more 
efficiently than trypsin, and that it is advisable to use slightly higher concentrations of 
proteases than would be optimal for an ordinary trypsin digestion, although optimal digestion 
conditions vary significantly between samples. 
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The foregoing descriptions of particular embodiments and examples are offered by 
way of illustration and not by way of limitation. All publications and patent applications 
cited in this specification and all references cited therein are herein incorporated by reference 
as if each individual publication or patent application or reference were specifically and 
individually indicated to be incorporated by reference. Although the foregoing invention has 
been described in some detail by way of illustration and example for purposes of clarity of 
understanding, it will be readily apparent to those of ordinary skill in the art in Ught of the 
teachings of this invention that certain changes and modifications may be made thereto 
without departing from the spirit or scope of the appended claims. 
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